Speech recognition with a seamlessly updated language model for real-time closed-captioning
نویسندگان
چکیده
It is desirable to consistently and seamlessly update a language model of speech recognition without stopping it for online applications such as real-time closed-captioning. This paper proposes a novel speech recognition system that enables the model to be updated at any time even while it is running. It can run the second decoder with the latest model in parallel, and their priority that must be accessed is controlled at a non-speech portion by an additional job process, which sends acoustic features only to an active target decoder with the latest model and sends recognized words to the backend manual error correction for closed-captioning. The system seamlessly updates the model and ensures endless speech recognition with the latest model at any time. Our new practical real-time closed-captioning system reduced word errors by two thirds with the proposed language model update mechanism in the speech recognition and captioning experiments for Japanese broadcast news programs.
منابع مشابه
Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries
This article describes the real-time speech recognition system for closed-captioning of TV ice-hockey commentaries. Automatic transcription of TV commentary accompanying an ice-hockey match is usually a hard task due to the spontaneous speech of a commentator put often into a very loud background noise created by the public, music, siren, drums, whistle, etc. Data for building this system was c...
متن کاملBroadcast Technology
Closed captioning to convey the speech of TV programs by text is becoming a useful means of providing information for elderly people and the hearing impaired, and real-time captioning of live programs is expanding yearly thanks to the use of speech recognition technology and special keyboards for high-speed input. This paper describes the current state of closed captioning, provides an overview...
متن کاملNew Real-Time Closed-Captioning System for Japanese Broadcast News Programs
A new real-time closed-captioning system for Japanese broadcast news programs is described. The system is based on a hybrid automatic speech recognition system that switches input speech between the original program sound and the rephrased speech by a ”re-speaker”. It minimises the number of correction operators, generally to one or two, depending on the difficulties of the speech recognition, ...
متن کاملA real-time Japanese broadcast news closed-captioning system
This paper describes a collaboration between Bell Labs and NHK (Japan Broadcasting Corp.) STRL to develop a real-time large vocabulary speech recognition system for live closed-captioning of NHK news programs. Bell Labs broadcast news recognition engine consists of a two-pass decoder using bigram language models (LM) and right biphone models during the first pass, and trigram LM with within-wor...
متن کاملReal-Time Closed-Captioning Using Speech Recognition
There is a great need for more TV programs to be closed-captioned to help hearing impaired and elderly people watch TV. For that purpose, automatic speech recognition is expected to contribute to providing text from speech in real-time. NHK has been using speech recognition for closed-captioning of some of its news, sports and other live TV programs. In news programs, automatic speech recogniti...
متن کامل